Efficient Beam Thresholding for Statistical Machine Translation
نویسندگان
چکیده
Beam thresholding is a widely-used pruning approach in decoding algorithms of statistical machine translation. In this paper, we propose two variations on the conventional beam thresholding, both of which speed up the decoding without degrading BLEU score. The first variation is the dynamic beam thresholding, in which the beam threshold varies with the length of source sequences covered by hypotheses. The second one incorporates a language model look-ahead probability into the beam thresholding so that the interaction between a hypothesis and the contexts outside the hypothesis can be captured. Both thresholding methods achieve significant speed improvements when used separately. By combining them together, we obtain a further speedup, which is comparable to that of the cube pruning approach (Chiang, 2007). Experiments also display that the dynamic beam thresholding can further improve the cube pruning.
منابع مشابه
An Efficient A* Search Algorithm for Statistical Machine Translation
In this paper, we describe an efficient A* search algorithm for statistical machine translation. In contrary to beamsearch or greedy approaches it is possible to guarantee the avoidance of search errors with A*. We develop various sophisticated admissible and almost admissible heuristic functions. Especially our newly developped method to perform a multi-pass A* search with an iteratively impro...
متن کاملThe Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language
Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...
متن کاملWord Reordering and a Dynamic Programming Beam Search Algorithm for Statistical Machine Translation
In this article, we describe an efficient beam search algorithm for statistical machine translation based on dynamic programming (DP). The search algorithm uses the translation model presented in Brown et al. (1993). Starting from a DP-based solution to the traveling-salesman problem, we present a novel technique to restrict the possible word reorderings between source and target language in or...
متن کاملIncremental Decoding for Phrase-Based Statistical Machine Translation
In this paper we focus on the incremental decoding for a statistical phrase-based machine translation system. In incremental decoding, translations are generated incrementally for every word typed by a user, instead of waiting for the entire sentence as input. We introduce a novel modification to the beam-search decoding algorithm for phrase-based MT to address this issue, aimed at efficient co...
متن کاملStatistical Post-Editing for a Statistical MT System
Statistical post-editing (SPE) techniques have been successfully applied to the output of Rule Based MT (RBMT) systems. In this paper we investigate the impact of SPE on a standard Phrase-Based Statistical Machine Translation (PB-SMT) system, using PB-SMT both for the first-stage MT and the second stage SPE system. Our results show that, while a naive approach to using SPE in a PB-SMT pipeline ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009